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1. Introduction 

In their recommendations for future extensions of Item Response Theory 
(IRT; see, e.g., Lord, 1980), Traub and Wolfe (1981) suggest that a 
"way of improving the linkage of achievement measurement and 
instruction is to obtain detailed data about the distribution (by student, 
class, school) of instruction and incorporate that inforr^ation into the 
response model. We are thinking here of the so-called 'opportunity-to- 
^earn' measures used in the lEA surveys and the exposure measures 
used, for example, by Fischer (1972)" (pp. 422-423). This paper 
represents an attempt in that direction. 

Recently, one can observe an increasing concern about the match between 
the school curriculum and what is being tested by standardized 
achievement tests. See for iritance Airasian and Madaus (1983), 
Haertle and Calfee (1983), Mehrens and Phillips (1986), Miller (1986). 
An interesting development is the use of "opportunity-to-learn" measures 
(Anderson, 1985), see e.g. Miller and Linn (1986) and Engelhard 
(1986). 

This paper discusses methodological implications of utilizing 
instructional information in combination with the usual item responses. 
Models that expand those of standard Item Response Theory (IRT) will 
be considered. As an illustration, we will consider the achieverr.3nt 
data of 



the U.S. sample of the Second International Mathematics Study (SIMS), 
Crosswhite, Dossey, Swafford, McKnight, and Cooney (1985). In SIMS, 
opportunity-to- learn (OTL) response was gathered tor both teachers and 
students corresponding to each item. Their response concerned whether 
or not the mathematics needed was covered during the present school 
year, and if not, whether it had been covered in prior years. 

When analyzing achievement data from students with varying 
instructional background, the modeling should recognize that a 
heterogeneous population is at hand. Three methodological issues are of 
particular interest: 

(1) How should the modeling capture the fact that the measurement 
relationship between the items and the latent trait may vary over 
students ? 

(2) How should trait dimensionality be assessed ? 

(3) How should the latent trait values be estimated ? 

In IRT analyses of standardized achievpr.ient tests, it is assumed that 
instruction increases the item performance through an increase in the 
latent trait level, while the item-trait relationship remains the same; 
hence, no "item bias". This may be too strong an assumption when the 
instruction 



is geared towards certain types of items in the test. If the assumption 
IS incorrect and biased items are not removed , biased latent trait 
estimates are obtained. Furthermore, the factorial structure may not 
be the same for a group with high coverage as for a group with low 
coverage; indeed, with very low coverage, the validity of the item is 
called in question. 

In this paper we will concentrate of the first question. It will be 
formulated as a problem of assessing "item bias", or instructional 
sensitivity in the items , when item specific OTL type information is 
available. In section 2 approaches that use traditional IRT methodology 
to assess measurement differences will be considered. He-e, groups of 
students are formed based on OTL score and differences in item 
characteristic curves are assessed after equating one group's 
measurement parameters to the other group's. Section 3 demonstrates 
the v/eakness of this item bias approach by means of an artificial data 
analysis. In section 4 a new method is proposed which does not 
necessitate the creation of groups to assess item bias and avoids the 
problem of the standard item bias detection approach. This method 
generalizes IRT modeling to allow for item specific variation in 
measurement relations, across students with varying instructional 
background (OTL). Item bias detection is obtained as a by-product. 
Section 5 applies the traditional and the new methods to the SIMS data 
and compares the outcomes in terms of item response curve bias. 



2. Traditional item b^as detection 



With the availability of auxiliary information such as OTL» a 
conventional item bias type analysis is naturally of interest. Groups 
are formed based on some criterion related to OTL. IRT estimation is 
carried out in each group, the parameter estimates are made 
comparable by some form of equating, and some form of item bias 
index is calculated for each item (see e.g.. Linn, Levine, Hastings, and 
Wardrop, 1981). In the present situation, however, it should be 
recognized that the auxiliary information of OTL is item specific. 
Whereas race or gender information puts a person in a group which is 
constant over the items, the OTL "group membership" varies with item. 
The problem is then how groups should be formed-or should they? 

One approach to overcome the item specific nature of the OTL measure 
IS to put in the same group students with the same OTL value ("Yes", 
"No", or graded in some way) for all items in some subset of the whole 
test and to study item response curve differences for this subset. Even 
with a very small number of items, however^ this is likely to lead to 
groups with small sample sizes, jeopardizing the reliability of the IRT 
estimation. The analysis would also ignore a large portion of the total 
sample available. 

Miller and Linn (1986) used an interesting alteniativc in analyzing 
SIMS data. Teacher reported OTL measures for each item were first 



factor analyzed, and groups of students were then fornned by cluster 
analysis base^d on the corresponding factor scores* The factors 
corresponded to groupings of itenns by content, such as the two for 
algebra: (1) formula, algebraic expressions, equalities and 
inequalities, an ' (2) signed number equations. This gave "curriculum 
clusters" that were well separated with respect to mean values of the 
factor variables. On comparison between item response curves for the 
same item across student groups, they found several items with 
considerable differences, i.e. evidence of "instructional bias" (see Linn 
& Hamisch, 1981). Miller and Linn (1986) noted that the magnitude of 
the biases were frequently larger Xhan commonly eoncountered when 
considering student groups formed by ethnicity. 

This approach has considerable merit and gave revealing results in 
terms of item response curve differences across clusters. It has, 
however, the drawback of basing the estimation of an item^s paramaters 
in a certain group (cluster) on students that may well Iiave a wide range 
of OTL factor values. The approach also has the disadvantage that a 
student in any given group, who has a certain OTL status for an item, 
tends to have the same OTL status for several other items. To solve 
this item, the student may then be at an added advantage or disadvantage 
due to presence or absence of training in related topics. 

There is, however, a general problem with item bias detection methods 
of the above type that woula seem to make them inappropriate for 
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situations of varying instructional coverage- As we will demonstrate in 
the next section, this is because such situations may often be 
characterized as involving groups for which many or most of the items 
may be biased. 

3. The problem of traditional item bias detection 

In this section an IRT model with hypothetical population values will be 
described. A traditional item bias detection scheme will be applied to 
this hypothetical model. 

Consider the situation of a set of p items y that measure a single latent 
trait rj. Assume that a two-parameter normal ogive model holds for the 
items (cf. Lord, 1980), 



where t is the standard normal distribution function, and a and b are the 
usual discrimination and difficulty parameters. Conditional 
independence is assumed as usual. 

The model above will be used to illustrate tfie problem of ordinary IRT 
bias detection in situations where the groups to be compared have 
different levels of instructional coverage or opportunity-to-learn. 
Consider for Simplicity two groups, group one having latent trait mean 



(1) P (y = 1 I rj) =(D [a (r)-b ) ] 
J J J 



; j -1, 2, ...p 
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0 and variance 1, and group 2 having mean 1 and variance 0,5, This 
may represent the situation of group 2 having had more OTL for the set 
of items at hand than group 1, so that students of this group both have a 
higher trait level and are more homogeneous with respect to this trait. 
The two groups will be referred to as the low and the high OTL group. 
Consider 40 items and five different situations of varying degree of 
bias. 

(1) Zero bias. None of the items are biased, i.e. for each item the 
same measurement parameters a, b hold in both groups. The item 
parameters vary over items as follows for item 1-10 (a, b): 
item 1 and 2: 0.98, -2.86; item 3 and 4: 0.98, -1.43; item 5 and 6: 
0.98, 0.00; item 7 and 8: 0.98, 1.43; item 9 and 10: 0.98,2.86. 

These same parameter values are used for each of four sets of ten 
items in the total set of 40 items. 

(2) 25 % bias. One of four sets of ten items shows bias, while tfie 
other three do not. For the ten biased items, the low OTL group is 
viewed as net having had sufficient instructional coverage in that the 
difficulty of each of the 10 items is perceived as higher than for the 
high OTL group. For the remaining three sets of ten items, the groups 
of students do not differ in OTL. This is captured by different b values 
for the two groups, while the a values remain equal. The group 1 (low 
OTL) b values are increased by 1.14 yielding the b values: 



er|c 



item 1 and 2: -1.72; item 3 and 4: -.29; item 5 and 6: 1.14; item 7 
and 8: 2.57; item 9 and 10: 4.00; 

(3) 50 % bias. Here, a situation similar to that of 25 % bias is 
considered, but twenty of the items are biased. The bias is the same as 
for 25 % bias except that it is applied to two sets cf ten items. 

(4) 75 % bias. As =ibove, but with three sets of ten biased items. 

(5) 100 % bias As above, but with all four sets of ten items being 
biased. 

Using the hypothetical measurement parameter values above, we may 
study the application of a common item bias detection scheme. The 
first step involves expressing the item parameters, using the a, b 
parameterization, in a metric that corresponds to a latent trait mean 
and variance of 0 and 1 . This metric corresponds to the one obtained 
for estimates obtained by standard IRT analyses. While the parameters 
for the low OTL group 1 then remain unchanged, the high OTL group 2 
values need to be scaled. 

The second step involves item response curve bias calculation, for 
instance in the simple way described in e.g. Linn et al (1981); see also 
Lord (1980). The mean and variance of the difficulty values are used 
to equate group 2 
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measurement parameter values to group 1 values. The item bias for 
each item is then expressed simply as the square root of the sum of 
squared differences between the item response curves of (1), summing 
at steps cf 0.1 from 3 to 3 (see Linn et al, 1981). Note that we may 
also calculate the true bias value for each item. This is obtained by 
calculating the value based on the original a and b value for each item. 
Linn et al (1981) describe a response curve bias value of 0.2 or larger 
as possibly of practical importance. 

Table 1 gives the results for the hypothetical bias cases of 25 % - 100 % 
bias. The case of no bias would simply show that the true and 
calculated bias values are zero for all items. 



Insert Table 1 here 



We note that for the cases of 25, 50 and 75 %, the difference between 
calculated and true bias increases with increasing bias proportion. 
While for 25 %, the calculated bias values are rather close to the true 
ones, large errors are observed for 50 % and 75 %. In all cases the 
cal^'jlated bias values seem to indicate bias where there is none, and 
underestimate bias where it existc. This reflects the fact that the 
detection technique operates under the assumption that no items are 
biasad and works reasonably well only when a small proportion uf items 
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deviate from this assumption. The fact that ihe technique makes bias 
decisions relative to the average it^^m is clearly shown in the case of 
100 % bias, where no bias is found. The fact that all items are biased 
is mistaken by this technique as an indication of a group difference in 
'^^'t distribution. 

We conclude that with strong differences in OTL between students it is 
quile possible that many or most items are biased, in which case the 
traditional detection technique is inappropriate. 

4. A new IRT extension incorporating OTL 

We will present a solution that avoids tfie problems of the 
traditional item bias detection scheme. Compared to such schemes, the 
rievv r:ipproach will ^^void the difficulty of forming groups prior to IRT 
estimatjon and will also avoid the need for the equating step. This is 
achieved by allowing the difficulty parameter for each item to vary with 
the OTL level. In this way, item specific variation in "group 
membership" is allowed for and the population heterogeneity is taken 
into account in the model specification. 

Assume the availability of OTL information for each of a set of p 
items y .. Let the OTL variable connected with item i be denoted x . . 
Let y be a p-vector of continuous latent response variables, such that 
for item j 
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(2) y. = 0, if y'j ^ r . 

1, otfierwise 



where is a threshold parameter defined Y y 
For the p- vectors and x, asoume 

(3) X r) + B X + t , 

(4) = y' X + C , 
Yielding 

(5) y"^ = (A y' + B) X + X ^ + e , 

where X is a p - vector of measurement slopes, q is the latent trait, B 
is a diagonal p x p matrix of slopes reflecting strength of influence 
of each x on the level of the corresponding y*, 6 is a p - vector of 
measurement errors with 7^ro expectation , y is a p - vector of 
structural slopes describing the influence of the x's on the trait, and ^ 
is a residual with zero expectation. It is assumed that y* conditional 
on X has a multivariate normal distribution. Assume further that € and 
^ are independent of each other and of x, and that e is also independent 
of q. 
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Let V (p = I// arxJ V (e) = 6, where G is diagonal. Due to normality it 
suffices to consider 

(6) E (y* I X) = (X y' X + B) X , 

(7) V (y* I X) = A »^ X' + e , 

where we may standardize to unit conditional y variances, yielding 

2 

diagonal 6 elements e. . = 1 - X. \// . Equations (6) and (7) describe 

the structure that the model imposes on t^e multivariate regression of 
★ 

y on X* Since the y's are dichotomous, tfie model may be termed a 
multivariate structural probit model. The model imposes restrictions 
on the p X p slopes and on the p(p-l)/2 residual correlations of an 
"unrestricted" multivariate probit regression model. 

Note that 

(8) E (y* . I r), X) = X . r) -h . X . , 

(9) V (y* . I r), X) = 6 . . , 

so that by standard results on conditional means and variances, 

(10) E (y*. In) =Xj r) + )5j E (x.) , 
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(11) V (y*^ I q) = ©jj + V (x^) . 

If is normal, the distribution of y* conditional on r) is normal and 
we have 

(12) P (y =11 r)) = $ ( [-r + E (y* I rj) ] [ V (y*, I r,) j'^} 

^ J J J 

so thai the standard normal ogive IRT parameters are obtained as 

(13, = .^j [ - V V (xj, 

(14) b =lr, E(x.)lX-'. 

J J J sj 

First note that for ^^s = 0, or no OTL x's present, this is the standard 

two-parameter normal ogive IRT model. With OTL x's present, each . 

J 

is presumably positive. In standard IRT, incorrectly ignoring the OTL 
information (or assuming ^. = 0), (13) and (14) show that we obtain 
non-invariance of the standard item parameters whon a certain set of 
items is administered to populations with varying OTL distribution; 
populations with larger OTL variance and higher OTL mean tend 
to have lower item discrimination and lower item difficulty, 
respectively. The present extended IRT model avoids such problems by 
incorporating this item non-invariance directly into the model. Note 
that the measurement parameters of r and X may still be invariant. 
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If X . is not normal, P (y . =1 1 r)) is no longer a nornnal ogive, altPx)ugh 
it does represent a monotonically increasing curve. We shall be 
particularly interested in cases where x . is dichotomous, denoting 
presence or absence of sufficient OTL according to some subjective 
criterion. In such cases, it is useful to consider the normal ogives at 
the two different x . values 1 and 0. Using (8) and (9), we then have 

(15) aj - X. e.j ^ , 

if 

From (8) and (16) we note that the j5. difference in conditional y . 

3 3 
mean at x . = 1 versus x . = 0 may alternatively be seen as a difference 

in item difficulty; item j is perceived in two dii'ferent versions, with 

and without OTL. 

We will show that this model formulation is a special case of a general 
structural model proposed by Muthen (1984). This general model 
extends traditional structural equation modeling with continuous 
variables to situations with categorical and other non-normal 
measurements, such as the dichotomous ones here. A simplified 
version of Muthen's general model assumes (using notation similar to 
that of (2) and (3)) 
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yielding 



(19) E (y*I X) = Ag (I-Bg)'VgX, 

(20) V {y* I X) = Ag (I - Bg)-' *g (I - Eg)'"' \^^g' 

Here, the subscript g is used to denote quantities corresponding to the 
general model of Muthen (1984) as opposed to the specific model 
prop sed above. Where quantities are the same, the subscript has been 
omitted. 



To see that the proposed model fits into the general framework, let 
= (y*% n) '?g' " ^ f'' 0^ and let 

(21) / = Ag,^, 
With 



'23' ^= ['p,p 0], 
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On V n ^ 

(25) B = [ P ^ P ] , 
g 0' 0 




(26) = [ ^, ] , 




^ 0 if, 




We may note that since the y variables are dichotomous, the diagonal 




elements of V (y I x ) may again be standardized to unity. This means 




that only the off-diagonal elements of V (y* I x), and therefore 6, enter 




into the analysis and that the diagonal elements of 6 may be fixed to any 




value. 




For the statistical background of this technique, the reader is referred 




to Muthen (1984) . Estimation is carried out by limited information 




generalized least squares, and a large sample chi-square test of model 




fit as well as standard errors of estimates are provided. Parameters 




may be of three kinds: free to be estimated, fixed to a certain value. 




and constrained to be equal to other parameters. The analyses to be 




presented have been carried by the LISCOMP program, which builds on 




the theory of Muthen (1984), see Muthen (1987). 




It may be noted that when needed, the proposed model may be easily 




generalized in the model framework of Muthen (1984). For instance, 




the B matrix need not have all off-diagonal elements fixed at zero, the 




0 matrix need not be diagonal, and there may be more than one q. 
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5. Applications 



For illustrative purposes, consider now the application of both ^he 
traditional bias detection technique and the proposed approach to some 
achievement items from the Second International Mothematics Study 
(SIMS). We vill analyze a set of eight dichotomously scored algebra 
core items described in Table 2. 



Insert Table 2 here 



The sample consists of 4,129 U S. eighth gr-^de students (Crosswhite, 
Dossey, Swafford, McKnight, and Cooney, 1985). 

5.1 Traditional IRT bias detection 

Consider first LISCOMP estimation of the standard two-parameter 
normal ogive IRT model of section 4 with no x's present, adding the 
assumption of ? normally distributed trait q. Similar estimates would 
be obtained by standard IRT analysis. LISCOMP gives a large sample 
chi-square test cf model fit and allows for violations of the conditional 
independence assumption in the form of correlated e residuals. The 
standard model of uncorrected residuals resulted in a chi-square of 
61.6 with 20 degrees of freedom. The number of degrees of freedom is 
the number of restrictions imposed on the correlations among the y*'s 
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(see e.g. Muthen, 1978). Relaxing the model restrictions somewhat, a 
strong improvement in fit was obtained when allowing the residuals for 
items 5 and 7 to correlate-, resulting in a chi-square of 45.^ with 19 
degrees of freedom. Given the large sample size, this is regarded as a 
satisfactory fit. 

In line with the student grouping approaches discussed in Section 2, this 
model was then used for groups based both on OTL and on type of 
mathematics class. The OTL measures were obtained from the 
teachers, where for each item the teacher responded to the question: 
"During this school year did you teach or review the mathematics needed 
to answer the item correctly ?" 

The answers were No (scored 0) and Yes (scored 1). A similar 
question was directed to the students. There is clearly a question of 
reliability of both these reports. The student response may be affected 
by the perceived difficulty of the item, and the teacher response 
concerns the class as a whole, where a claim of coverage may be 
irrelevant for the student who was absent. We have chosen to work with 
the teacher response since we feel it may be the least unreliable; the 
fact that this measurement is not on the student level is here ignored. 

A low and a high OTL group was created by splitting the students based 
on the sum of the item OTL scores at ^ 6 versus higher, resulting in 
sample sizes of 2,101 versus 2,028. As an alternative, students were 
also divided into two groups based on type of mathematics class. 
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Remedial and Typical chsses were contrasted with Enriched and 
Algebra classes, yielding 2,592 versus 1,537 students. 

In the low OTL group, the model with 19 degrees of freedom obtained a 
chi-square value of 26.6, while in the high OTL group, 47.9 was 
obtained. For the "low" class types, the chi-square 30.2 was obtained, 
whereas the "high" class types obtained the value 36.5. The estimation 
was carried out with trait mean of zero and trait variance one. The 
estimated r and A values can be translated to the IRT a and b values by 
setting jS = 0 in (13) and (14). The response curve bias index may then 
be computed as usual. 

The left-most part of Table 3 ("Troditional") gives the resulting item 
bias values for each item given the two ways of dividing students into 
groups. These results are given both for the standard model with 
uncorrelated residuals (Model I; 20 d.f.) and the model allowing the free 
residual correlation (Model II; 19 d.f.). 



Insert Table 3 here 



We note that the least amount of bias is observed for the first three 
items. There seems to be little difference between bias values 
calculated from Model I versus Model II, and using the two ways of 
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creating the student groups. 



5.2 Applying the new IRT approach to the SIMS data 

Applying the new approach to the SIMS algebra core items, a chi-square 
test of model fit gave the value 223.5 with 68 degrees of freedom. The 
number of degrees of freedom is obtained as the total number of 
restrictions imposed on the p x p regression slopes of E (y* I x) and the 
p(p-l)/2 residual correlations of V {y* I x) (see Muthen, 1984). A 
strong improvement in fit was obtained when allowing the residuals for 
items 5 and 7 to correlate and a further improvement was obtained 
when allowing correlation between the residuals for items 6 and 8. 
For simplicity, we chose as our final model the one with only errors 5 
and 7 free to correlate, resulting in a chi-square with 208.8 with 67 
degrees of freedom. Let this model be denoted Model IIL The 
estimates from Model III are given in Table 4. 



Insert Table 4 here 



The most interesting result concerns the estimated fi values on the 
diagonal of B representing tfie effect of each of the OTL variables x on 
the corresponding response variable. Note that this is an effect over and 
above that of r), so that we are describing the effect of OTL for given 
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achievement trait value. For the first four items we have strong 
positive effects of OTL while the last four exhibit insignificant OTL 
effects. 

The estimated ^ values and their standard errors give a succinct way of 
assessing item bias, or instructional sensitivity, in each item. 
However, for comparison it may be of interest to study the 
corresponding item response curve bias values. These may be computed 
from the estimated Model III by (15) and (16). The bias values are 
given in Table 3 in the Model III column. We note that the results 
contradict those for Model I and Model II using the traditional approach 
of dividing the students into groups. The difference is particularly 
strong for the last four items. The difference is possibly due to the 
deficiency of the traditional approach discussed in section 3. 
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TABLE 1 

Item Bias In Artificial Data 



(Entries are the square root of the sun of squared response curve difference) 



25% Bias 


1 


50% 


Bias 






Biased 


Set 


(10 Items) 




Biased Sets (2 x 


10 Items) 




Calculated 


True 






Calculated 


True 




Iten 


Bias 


Bias 


ui r lerence 


Item 


Bias 


Bias 


Difference 




.32 


.39 


. U r 


1.2 


.24 


.39 


-.15 


3,4 


.35 


.45 


— in 


3,4 


.25 


.45 


-.20 


5,6 


.34 


.45 


-.11 


5,6 


.23 


.45 


-.23 


7,8 


.29 


.41 


-.12 


7.8 


.18 


.41 


-.23 


9,10 


.18 


.27 


-.10 


9, 10 


.10 


.27 


-.17 


Average 


.30 


.39 


-.09 


Average 


.20 


.39 


-.19 




Unbiased Sets (3 


X 10 Items) 




Unbiased Sets (2 x 10 Items) 


1.2 


.07 


.00 


.07 


1.2 


.15 


.00 


.15 


3,4 


.10 


.00 


.10 


3,4 


.20 


.00 


.20 


5.6 


.11 


.00 


.11 


5.6 


.23 


.00 


.23 


7,8 


.12 


.00 


.12 


7.8 


.23 


.00 


.23 


9,10 


.10 


.00 


.10 


9.10 


.17 


.00 


.17 


Average 


.10 


.00 


.10 


Average 


.20 


.00 


.20 


75% Bias 


100% Bias 




Biased Set 


(3 X 10 Items) 




Biased Sets 12 x 


10 Items) 


Calculated 


True 




Calculated 


True 




Item 


Bias 


Bias 


Difference 


Item 


Bias 


Bias 


Difference 


1.2 


.13 


.39 


-.26 


1.2 


00 


.39 


-.39 


3,4 


.13 


.45 


-.32 


3.4 


.00 


.45 


-.45 


5.6 


.11 


.45 


-.34 


5,6 


.00 


.45 


-.45 


7,8 


.09 


.41 


-.32 


7,8 


.00 


.41 


-.41 


9,10 


.05 


.27 


-.23 


9,10 


.00 


.27 


-.27 


Average 


.10 


.39 


-.29 


Average 


.00 


.39 


-.39 




Unbiased 


Set (10 Items) 










1.2 


.26 


.00 


.26 










3,4 


.32 


.00 


.32 










5,6 


.34 


.00 


.34 










7,8 


.32 


.00 


.32 










9,10 


.23 


.00 


.23 










Average 


.29 


.00 


.29 
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TABLE 2 

Wordlne for Eight Posttest Algebra Core Iteas 



If 5x + 4 = 4x - 31, 
then X Is equal to 



A 
B 

C 
D 
E 



-35 
-27 
3 
27 
35 



If P = LW and If P = 12 

and L - 3, then W Is equal to 



A 
B 

C 
D 
E 



3/4 
3 
4 
12 

36 



3. (-2) X (-3) Is equal to 



6. A shopkeeper has x kg 
of tea In stock. He 
sells 15 kg and then 
receives a new lot 
weighing 2y kg. What 
weight of tea does he 
now have? 

A X - 15 - 2y 

B X + 15 + 2y 

C X - 15 + 2y 

D X + 15 - 2y 

E None of these 

7. The table below coapares 
the height froa which a 
ball Is dropped (d) and 
the height to which It 
bounces (b). 



B 


-5 












C 
D 


-1 
5 


d 


50 


80 


100 


150 


E 


C 












4. If 


4x/12 = 0, then x Is equal to 


b 


25 


40 


50 


75 



A 0 

B 3 

C 8 

D 12 

E 16 

5. The air teaperature at 
foot of a nountaln 
Is 31 degrees. On top 
of the Bountaln che 
temperature Is -7 
degrees . How much 
Warner Is the air at 
the foot of the aountaln? 



Nhlch foraula describes 
this relationship? 



A 


b 


^ d2 




B 


b 


= 2d 




C 


b 


-= d/2 




D 


b 


= d + 


25 


E 


b 


=■ d - 


25 



8. The sentence "a number x 
decreased by 6 Is less than 
12*' can be written as the 
Inequality 



A -38 regrees 

B -24 degrees 

C 7 degrees 

D 24 degrees 

E 38 degrees 



A 


X 


- 6 > 


12 


B 


X 


- 6 > 


12 


C 


X 


- 6 < 


12 


D 


6 


- X > 


12 


E 


6 


- X < 


12 
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TABLE 3 

Item Bias Values for Various SINS Models 



Item 




Traditional 




New 




Model 


I 


Model 


II 


Model III 




OTL 


Class 


OTL 


Class 




1 


.06 




. UD 


.06 


.15 


2 


.17 


1 2 


1 7 


.12 


.11 


3 


.13 


AC 


. 13 


.17 


.30 


1 


.26 


.19 


.25 


.18 


.••3 


5 


.19 


.20 


.22 


.21 


.02 


6 


.23 


.22 


.22 


.22 


.01 


7 


.22 


.18 


.23 


.18 


.01 


8 


.21 


.25 


.21 


.24 


.03 
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TABLE 4 
Estlaates froa Model III 
Neasureaent Paraneters 



Itejn T A 0 

Est. 1 -ratio Est. t-ratlo Est. t-ratlo 

1 1.45 22.31 0.77 17.50 0.33 6.22 

2 0.07 0.97 0.86 23.43 0.22 3.72 

3 0.93 10.79 1.00 0.00 0.64 8.93 

4 0.66 9.51 0.86 23.19 0.28 5.46 

5 0.52 6.41 0.85 22.93 -0.03 -0.49 

6 0.31 4.78 0.94 24.36 -0.03 -0.65 

7 0.60 11.53 0.67 18.04 -0.02 -0.38 

8 0.23 3.87 0.80 21.08 0.05 1.12 

y ^ 

X variable Est. t-ratlo Est. t-ratlo 

0.43 18.18 

1 0.20 6.09 

2 -0.15 -3.28 

3 -0.01 -0.22 

4 0.03 0.72 

5 0.13 2.22 

6 0.23 7.41 

7 0.17 6.12 

8 0.33 9.28 
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